WORD EQUATIONS IN SIMPLE GROUPS AND 
POLYNOMIAL EQUATIONS IN SIMPLE ALGEBRAS 

on 

ALEXEY KANEL-BELOV, BORIS KUNYAVSKII, EUGENE PLOTKIN 
O 

*■ ^ ■ To Kolya Vavilov, friend and colleague, on the occasion of the 60th anniversary 

^f* Abstract. We give a brief survey of recent results on word maps on simple 

groups and polynomial maps on simple associative and Lie algebras. Our focus 

00 ■ is on parallelism between these theories, allowing one to state many new open 

problems and giving new ways for solving older ones. 

rn ' There were different times: a time 

to throw stones, a time to divide 

and subtract. Now it is a time to 

C^ ' add and multiply. Circumstances 

force us to focus on adding and 
multiplying. 



- * — \ 



From an interview of Mikhail Silin, 

^ , first vice-rector on research of 

£NJ ■ Gubkin Russian State University of 

l/~) ' Oil and Gas to the newspaper 

^^ . "Vestnik of Murmansk", December 

^) • 4, 2009 

f^ | Keywords: word map; simple group; engel words; dominant map; group width; 

£f) • polynomial map. 



1. Introduction 



In this paper we discuss word maps 

w:G d ^G, (1) 

induced on any group G by a group word w — w{x\ , . . . , Xd) in x\, x± , . . . , Xd, X^ 
(= an element of the free group !Fd)- We also consider polynomial maps 

P:A d ^A, (2) 

induced on any associative (or Lie) algebra over a field k (or a ring R) by an 
associative (or Lie) polynomial P = P(Xi, . . . , Xd) in d variables (= an element of 
the free associative (or Lie) algebra). 

Both (P) and © are evaluation maps: one substitutes d-tuples of elements of the 
group G (algebra A) instead of the variables and computes the value by performing 
all group (algebra) operations. We are interested in surjectivity of maps ([1]) and 
(J2), or, more generally, in description of their images. In lowbrow terms, we are 
interested in solvability of equations of the form 

l 
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w(xx,...,x d )=g, (3) 

or 

P(X lt ...,X d ) = M, (4) 

for every right-hand side, or for some "typical" right-hand side, or whether every 
element of the group (algebra) admits a representation as a product (sum) of finite 
number values of the word (polynomial) map, etc. 

This setting is a particular case of the following one. Let be a variety of 
algebras, H be an algebra in 0, W(X) be a free finitely generated algebra in 9 with 
generators xi, . . . ,Xd- Fix w £ W(X) and consider the word map w: H d — > H. 
Varying and H £ Q, we arrive at the problem of solvability of equations in 
different varieties over different algebras. In this note we restrict ourselves to the 
varieties of groups, associative algebras, and Lie algebras. 

Whereas in the group case the theory has been intensely developing and led to 
several spectacular results, including answering some old-standing questions, on the 
ring-theoretic side much less is known, though some new approaches to not less old 
questions have been recently found. Our main goal in this survey, which does not 
pretend to be comprehensive, is to emphasize parallels between the two theories. 
We hope that this may bring cross-fertilization effects in near future. With an eye 
towards such a unification, we put here more questions than answers. 

The interested reader is referred to the monograph [He] and surveys jShlj . [Ni) . 
[BGK] in what concerns the group case. Some references on the algebra case can 
be found in |KBMR] . [BGKPj . We leave aside extremely interesting questions on 
the number of solutions of equations ([3]), Q (or, in other words, on the structure 
of the fibres of maps (JTJ, (J2J). Sec [BGK for an overview of some results in this 
direction. 

2. Value sets 

Given a word map ([1}, it is important to distinguish between the following 
objects: 

• the value set in strict sense: w(G) — {g £ G : 3{gi, ■ ■ ■ ,gd) w (gi, ■ ■ ■ ,9d) = 

ah 

• the symmetrized value set w(G) ± consisting of the elements of w(G) and 
their inverses; 

• the verbal subgroup (w(G)) of G generated by w(G). 

Respectively, given a polynomial map ([2]) of fc-algebras, we distinguish between the 
value set P(A) = {a £ A : 3(oi 5 . . . , ad) -P(«i, • ■ • , o-d) = a } and the vector space 
(P(A)) spanned by P{A) over k. 

It is usually much easier to describe (w(G)) and (P(A)) than the actual value 
sets. 

2.1. Surjectivity. We start with the group case and ask whether word map (Q~]) is 
surjective. More precisely, 

• given a class of groups Q, we want to describe words w for which map ([1]) 
is surjective; 

• given a class of words W, we want to describe groups G for which map ([T]) 
is surjective. 
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In each of these set-ups arising problems range from very easy to extremely 
difficult, depending on the choice of the class. Here are some examples. Let us 
start with the first approach and take Q to be the variety of all groups. Then 
the needed description is given by [Sel Lemma 3.1.1], where such words are called 
universal: they are all of the form w = x ei . . . x^ d w' where w' is a product of 
commutators and gcd(ei, . . . , e^) = 1. Here is a parallel question for associative 
algebras: 

Question 1. What are polynomials P such that the map P: A d — > A is surjective 
for all associative algebras A? 

When we restrict the class Q, we arrive at more interesting and difficult questions. 
Answers heavily depend on this choice. For instance, a theorem of Rhemtulla (see 
[Sel Theorem 3.1.2]) says that if Q consists of the free groups (and maybe also 
of some free products adjoined), then the behaviour of any non-universal word is 
very far from surjectivity: in the terminology explained below, any such word is of 
infinite width for any group of Q. That is why in this survey we prefer to stay away 
from equations in free (and close to free) groups; see, e.g., jCRKj . particularly the 
introduction, for a comprehensive bibliographical survey of vast literature on this 
fascinating theory. 

We mainly focus on another extreme case of simple groups which can also be 
viewed as a building block for some more general theory. Here is our first question. 

Question 2. Let Q be the class of simple groups G of the form G — &(k) where 
k = k is an algebraically closed field and Q is a semisimplc adjoint linear algebraic 
group. Is it true that for all non-power words w ^ 1 word map ([I]) is surjective? 

Remarks 1. (i) If we drop the assumption k — k, or allow G to be not of 

adjoint type (i.e., consider quasisimple groups), there are easy counter- 
examples to surjectivity |My| , |Bo] . On the other hand, if G = SU(n) 
and w(x,y) is any word not belonging to the second derived subgroup of 
.7-2, then the induced word map is surjective for infinitely many n |ETj . 
However, if w does belong to the second derived subgroup of .7-2, it may be 
far from surjective (its image can be arbitrary small in the real topology of 

GOTO. 

(ii) If Q is some infinite family of finite simple groups, then any power word 
w = x n gives rise to the word map which is not surjective for infinitely 
many groups (those whose order is not prime ton). A conjecture of Shalev, 
asserting that such phenomenon may arise only for power maps, turned out 
to be over-optimistic, see [JLOj for a counter-example. For simple algebraic 
groups from Question [51 a power word may not be surjective cither. Sec 
[Stb) . [Chal] - [Cha4] for details, 
(iii) Going over to analogues of Question [2J for finite-dimensional simple asso- 
ciative and Lie algebras over an algebraically closed field, one immediately 
gets obvious obstructions: there are nontrivial (associative and Lie) polyno- 
mials that vanish identically; in addition, an associative polynomial map of 
matrix algebras may take only central or trace-zero values; even apart from 
these obvious obstructions, there are counter-examples to surjectivity, see 
[BGKPj and [KBMRj : if the ground field is not algebraically closed, the sit- 
uation is even more complicated (for example, it is an interesting question 
which of the surjectivity results of jKBMRj . obtained over quadratically 
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closed fields, survive over R). Nevertheless, some of techniques developed 
for associative polynomials may turn out to be useful for attacking Question 
[H see Remarks |3] and 0] below. 

The opposite direction for studying surjectivity is more tricky, even if the class of 
words W consists of only one word. The first non-universal word to be considered 
is commutator. 

2.1.1. Commutator: Ore and beyond. Let w — xyx~ 1 y~ 1 G T<i- The obvious neces- 
sary condition for the surjectivity of map ([1} is {w(G)} = G, i.e., G must be perfect. 
It is very far from being sufficient, even within the class of finite groups, see, e.g., 
[Is] . Making the further assumption that G is simple, one can say something much 
more positive, namely that map (H} is surjective in each of the following cases: 
(i) G is finite (Ore's problem, whose long history has been finished in jLOSTlp : 
(ii) G — Sao, the infinite symmetric group |Orj . [DRj (more generally, any 

non-power word is surjective on S^ |Ly| ) ; 
(iii) G = G(fc), where G is a semisimple adjoint linear algebraic group over an 
algebraically closed field k [Reej (and for many other similar cases, such as 
semisimple Lie groups, etc., see |VWj ): 
(iv) G is the automorphism group of some "nice" topological or combinatorial 
object (e.g., the Cantor set), as listed in [DR; see also [BM . 



In connection with (iv), it is worth quoting I. Rivin (see http : //mathoverf low . 



net/questions/77398/how-did-ores-conjecture-become-a-conjecture): "It 
is a conjecture I attribute to myself, but probably goes back to the ancients, that 
in every reasonable simple group every element is a commutator" . Of course, the 
whole point here is in the word "reasonable" : as any meaningful principle, Rivin's 
principle is subject to breaking, see Section [3^41 for counter-examples. In a positive 
direction, we suggest the following question: 

Question 3. For each of the infinite simple groups listed in |DRJ . is it true that any 
word map is surjective (or, at least, has a "large" image)? (In other words, is it true 
that in each such group one can solve word equations with "generic" right-hand 
side?) 

We also note that the Ore property is not just a standing alone phenomenon in 
group theory. It has many interesting applications in algebraic topology (see [PR] 
and references therein) and birational geometry |Kuj . 



Remarks 2. There are several ways to go beyond Ore. 

(i) First, representing 1 ^ g G G in the form g — [x,y\, one may require to 
make the choice of x as "uniform" as possible. A typical result of such kind 
is a theorem of Gow |Gowj : every finite simple group G of Lie type contains 
a regular semisimple conjugacy class C such that each semisimple g G G 
can be represented in the form g = [x, y] with x G C. A similar result in 
the case where G is a split semisimple Chevalley group over an infinite field 
follows from the prescribed Gauss decomposition [EG . The case where G 
is not split seems open, 
(ii) One can require that x, y satisfy some additional properties. For example, in 
,DR] Ore's theorem on the infinite symmetric group has been strengthened 
by choosing x and y so that they generate a transitive subgroup of Soo. It 
would be interesting to get similar results in the other cases listed above. 
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For instance, one can ask how big can the subgroup (x, y) C G be. Note 
that we cannot guarantee (x, y) — G, there are counter-examples among 
alternating groups (we thank A. Shalev for this remark). 

(iii) Another well-known generalization is Thompson's conjecture asserting the 
existence of a conjugacy class C in every simple group G such that C = G. 
This problem has a positive solution for the split semisimple algebraic 
groups [EGj . However, it is still open for finite simple groups, in spite 
of the major breakthrough made in [EG (see also |YCWj and some more 
recent results (see, e.g., |BGKj for relevant references)). The case of the 
infinite symmetric group has been settled in [Gr] , It is a natural question 
whether there is a "gap" between Ore's and Thompson's conjectures, i.e., 
whether there is a simple group where Ore's conjecture holds and Thomp- 
son's does not. N. Gordeev pointed out that such a group exists: take 
G = Aqo, the finitary alternating group. Then evidently each element of 
G is a commutator but in every fixed conjugacy class C of G any element 
moves a fixed number N of points, hence any product of two elements of 
C moves at most 2N points. Therefore C 2 ^ G. 

(iv) In the case where L is a "classical" Lie algebra (i.e., the Lie algebra of a 
Chevalley group G over a sufficiently large field), most of the statements 
discussed above admit easy analogues when the bracket stands for the Lie 
operation. Say, the analogue of Ore's conjecture has been established in 
[Br] , and an analogue of Gow's theorem can easily be obtained looking at 
the linear map tp x : L — ► L, y i— » [x, y]: its kernel is the centralizer of x, and 
one can conclude that in the representation g = [x, y] the element x can 
be chosen from a fixed G-orbit (with respect to the adjoint action of G on 
L). Further, for the Lie algebras of such kind the analogue of Thompson's 
conjecture also holds true [Gorde] . However, all such generalizations seem 
unknown for non-split simple classical Lie algebras, simple Lie algebras 
of Cartan type in positive characteristic, and for infinite-dimensional Lie 
algebras. The first case to be explored is that of Kac-Moody algebras (as 
well as Kac-Moody groups, where the prescribed Gauss decomposition is 
known [MP] 1. 

(v) Let now A be an associative algebra, and the polynomial under considera- 
tion be the additive commutator P(X,Y) = XY — YX. Let [A, A] denote 
the subalgebra additively generated by the values of P. If A ^ [A A], 
there is no hope for the surjectivity of the map P: A 2 — > A. This happens 
whenever A is finitely generated over Q or, more generally, if A is any (not 
necessarily finitely generated) Pi-algebra [Be], [Mes] . So reasonably inter- 
esting questions only arise for the kernel of the reduced trace map (trace 
zero matrices, for brevity). If A = M n (D), where D is a division algebra 
over a field and n > 2, then every trace zero matrix is an additive commu- 
tator [ARj . If, however, n = 1, the question is open (except for the special 
case of a central division algebra of prime degree over a local field RosJ), 
and it is hard to believe in an easy affirmative answer (that would imply the 
affirmative solution of an old open problem on the cyclicity of any central 
simple algebra of prime degree p over a field of characteristic p, see [Ros] 
for details). From discussions with L. Rowen we got an impression that one 
should rather expect a negative answer, and a counter-example could arise 
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in the algebra of generic matrices (see below). This shows that the ques- 
tions posed in (iv) for non-split simple Lie algebras are probably answered 
in the negative. Note that a too straightforward attempt to extend the 
theorem of Amitsur-Rowen to M n (R), where R is an arbitrary ring, fails 
in general |RR2j . |Ros] . |Mesj . As to infinite-dimensional simple algebras, 
the questions seem to be totally unexplored, to the best of our knowledge. 

2.1.2. Engel words. Apart from the commutator, there are some other interesting 
words for which the surjectivity is known in several cases; see, e.g., |BGKj for some 
relevant references. Here we only want to mention the most natural generalization 
of the commutator, the family of Engel words e n (x, y) = [[[x, y],y] ■ ■ .y\. Their 
surjectivity has been only established for groups G = PSL(2,g) (under some as- 
sumptions on n and q) [EGG] G = S U(n) [ET].G = PSL 2 (C) [H], [BGG] and for 
simple classical Lie algebras pBGKP]. An answer to the following question would 
be a natural continuation of |Ree] . 

Question 4. Let G be a connected semisimple adjoint linear algebraic group over 
an algebraically closed field. Is it true that every Engel word induces a surjective 
map G 2 ^Gl 

2.2. Dominance: the Deligne Sullivan trick and Amitsur's theorem. We 

start with a theorem of Borel |Boj providing a sketch of a somewhat new approach 
(the proof given in [Laj is essentially the same as in the original paper). Our proof is 
based on using the generic division algebra (see |Fo2j for details on this important 
object, including the history of its creation). Let F be a field (for simplicity, 
assumed of characteristic zero), let n be a positive integer, let x\a (1 < i,j < n, k 6 
N) be independent commuting indeterminates. The F-subalgebra of M n (F\x\A) 
generated by the matrices Xk = (a&) is called a ring of generic matrices. Denote 
it by R — F{X}. It is a domain )Am2j . and its ring of fractions Q(R) is a central 
division algebra of dimension n 2 over its centre Z(Q(R)) [Ami] . The centre Z(R) of 
R consists of the central polynomials (and is hence nontrivial for every n [Fol| . [Ra] ). 
The centre C(Q(R)) is the field of quotients of C(R). It is a long-standing open 
problem whether the field C(Q(R)) is a purely transcendental extension of F, see 
the surveys [E5], [LB], [ABGVj . 

Theorem 1. [Boj Let w £ T d (d > 2) be a nontrivial word, and let G be a connected 
semisimple algebraic group over a field F. Then the induced morphism w: G d — > G 
of underlying algebraic varieties is dominant (i.e., its image is Zariski dense). 

Proof. We present a sketch of proof in the case where F is of characteristic zero. 
Several parts of the proof are exactly the same as in the original one. First, as 
dominance is compatible with any extension of the ground field, we may and will 
assume F to be algebraically closed of infinite transcendence degree. Next, we 
may assume that G is simple of type A n (the reduction to the simple case and the 
passage from SL„ to the other types are as in [Bo] ). Further, it is enough to prove 
the dominance for the case where w is a product of commutators. Indeed, suppose 
that the theorem is proven for such words, and let w : G — >• G be an arbitrary word 
map. Consider the map w: G 2d — > G defined as follows: w{x\,y\, .. . ,Xd,yd) '■= 
w([xi,yi], . . . , [xd, yd])- Then the image of w is dense, hence so is the image of w. 
Thus henceforth we assume w 6 [Fd, J~d\- Furthermore, it is enough to prove the 
dominance for the map 

w: (GL„) d ^SL„. (5) 
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Indeed, the image of this map coincides with w(SL n ) because every g e GL„ can 
be replaced with g/ det(g). 

Let us now argue by induction on the rank. The case of rank 1 is treated exactly 
as in |Boj . The key point is the induction step. Assume that map ([5]) is dominant 
for the rank n — 1 and prove that it is dominant for the rank n. As in the original 
proof, it is enough to prove the existence of a matrix C in the image of w such that 
none of its eigenvalues equals 1. 

To this end, it suffices to prove that the image contains a generic matrix with 
this property. Indeed, as soon as this is established, by specialization arguments 
(which are legitimate because of the assumption on the transcendence degree of F) 
we obtain a non-empty Zariski open set of needed matrices. 

To prove the existence of a generic matrix with the required property, assume 
the contrary. Let C denote a generic matrix from the image of w. Since it has an 
eigenvalue 1, the characteristic polynomial pcif) is divisible by t — 1, so pc(t) = 
(t — l)pi(t). The Cayley-Hamilton theorem then gives (C — I n )p\(C) = 0, which is 
in a contradiction with Amitsur's theorem stating that the ring of generic matrices 
is a domain. □ 

Remark 3. In the proof presented above, the core induction argument is based on 
Amitsur's theorem, instead of the Deligne-Sullivan argument used in the original 
proof (which is based on going over to an anisotropic form of G and dates back to 
the unitary trick of Weyl). We believe that this interrelation between matrix groups 
and algebras is very important, and its potential is not exhausted. In particular, 
such an approach may be useful for proving the surjectivity of the word map for 
semisimple groups over algebraically closed fields and, more generally, for getting 
a more precise description of the image of the word map. 

Let w = TT „• cii -\- Consider the values of w on invertible matrices, which will be 
denoted by the letters <Zj, i — 1, . . . , k. The matrix a$ satisfies the Cayley-Hamilton 
equation 

< - 6KK" 1 + • • • + (-l)%(ai)ar l + ■■■ + (-l)^n(o*) = (6) 

where £j is the I characteristic coefficient of the matrix cjj. In particular, £i = tr, 
£„ = dct. In the zero characteristic case, the £/ are expressed via traces of powers. 
Equation (|6|) can be rewritten in the form 

ra-l 

(^(-l)"-'ar i 6-i(ai))det(o i )- 1 = ar 1 . 

Set ip( a i) = Ya=i (~ l) n_i a™ _ £z-i( a i) and rewrite the product w — JJja,^, re- 
placing a~, n with tp{ai(j)) and leaving the factors auj) unchanged. We will obtain 
a polynomial (in the signature enlarged by the characteristic coefficients, in other 
words, a polynomial with forms), which is a product of polynomials in one variable. 
There are some optimistic considerations regarding the study of images of such 
polynomials. Let us look at the construction of homogeneous polynomials whose 
image does not coincide with the whole space [KBMR . Let P be an arbitrary 
matrix polynomial, and let cet. Consider the product 

F = (Ai(P) - cA 2 (P))(A 2 (P) - cAi(P))P = ((1 + c 2 + 2c) det(P) - ctr(P) 2 )P. 
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It vanishes if the ratio of the eigenvalues Aj(P) equals c ±x or they are both zero. 
Therefore the image of F does not contain nonzero matrices with such a ratio of 
eigenvalues, and this construction is essential for the nature of the problem. If 
we are given a nontrivial product of polynomials in one variable in which several 
variables enter, then a requirement on the resulting relation between eigenvalues 
cannot be determined by one factor (if its determinant is not equal to zero). Of 
course, ip is obtained by canceling the determinant, but nevertheless there is some 
ground for optimism regarding making Borel's theorem more precise. 

Remark 4. Let us give an example of a more complicated argument showing the 
existence of a unipotent element in the image of a multilinear polynomial P in the 
space of matrices of second order (assuming it has a noncentral value with nonzero 
trace). 

So let P(X±, . . . , X n ) be a multilinear polynomial, and assume the contrary, i.e., 
P has no unipotent values. Fix all variables except for X\. Then Px 2 ,—,x n (Xi) = 
P(Xi, . . . , X n ) is a linear in X\ function, meeting the discriminant surface at points 
of total multiplicity 2. Since we assume the absence of unipotent values, each such 
point is either scalar or nilpotent. If one point, X±, is a nonzero scalar, and the 
other, Xl, is nilpotent, then at the point X® + X\ we obtain a matrix proportional 
to a unipotent one with some coefficient A ^ 0, and then at the point (X ® + X\)/\ 
we shall obtain a unipotent value. 

Further, if at both points we have a scalar value, then the original value of P 
is scalar, and if both values are nilpotent, then the original value has zero trace. 
Therefore, if the value of P{X\, . . . , X n ) is not scalar and has nonzero trace, then 
while moving any coordinate we observe coincidence of the intersection points with 
the discriminant surface, i.e., tangency to the discriminant surface. 

Then one can extract square root of the discriminant (Ai — A2) 2 . In other 
words, q = Xi(P(Xi, . . . ,X n )) — A2(P(-X"i, . . . ,X n )) is a polynomial in entries of 
Xi, . . . ,X n . The group SL2 acts by simultaneous conjugations on the Xi and is 
connected. Then, under such an action, q is mapped to ±q, and connectedness 
guarantees the invariance of q, by the first fundamental theorem (established in 
|Prj in characteristic zero and in |Do| in positive characteristic). 

Since q is a polynomial in characteristic values of the products of the Xi, so are 
Ai(P), A 2 (P) because Ai(P) + A 2 (P) = tr(P). 

By the Cay ley-Hamilton theorem, (P — Ai (P))(P — A 2 (P)) = which also 
contradicts Amitsur's theorem on zero divisors. Hence P does have unipotent 
values. 

We hope that such kind of reasoning can be helpful in getting a more precise 
description of the image of the word map in the set-up of Borel's theorem. 

Remark 5. Note that an idea of using generic matrices has been recently used, in 
a slightly different context of "universal localization", in |HSVZ1] and [Step] , for 
getting subtle information on commutators in Chevalley groups over rings. 

Remark 6. There are results of Borel's flavour, stating that for some infinite 
groups G any "generic" element g e G falls into the value set of any non-power 
word w [Ml], |DT] . 

Remark 7. In the case of Lie algebras, an analogue of Borel's theorem for Lie poly- 
nomials P which are not identically zero, was established in [BGKPj for semisimple 
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Chevalley algebras (modulo several exceptions over fields of small characteristic), 
under the additional assumption that P is not an identity of sl(2). It is not clear 
whether this assumption can be removed. The answer heavily depends on whether 
one can extend a construction of so-called 3-central polynomials (see, e.g., [Row! 
Theorem 3.2.21]) to the Lie case. (Recall that a polynomial P is called n-central 
is P n is central but P is not.) If such polynomials exist, they provide an example 
of a map which is not dominant. Probably, multilinear Lie polynomials with such 
a property do not exist, and in this case one can drop the assumption mentioned 
above. 

Remark 8. The case of associative algebras is much more tricky. First, one has 
to take into account the obvious obstructions to dominance, and assume that the 
polynomial P is non-central and contains at least one value with nonzero trace. 
Even under these assumptions, there are counter-examples to dominance for maps 
on M<2{K); to avoid them, one has to make additional assumptions on P (say, to 
assume that it is semi-homogeneous). This assumption is not enough for 3x3- 
matrices. One can show that if P is a nonscalar polynomial such that not all its 
values on A = M 3 (K) are 3-scalar or traceless, then its value set P(A) is dense 
in A (moreover, if N is the set of non-diagonalizable matrices, then P(A) n A" is 
dense in N). If P(A) lies in sls(K) and contains a matrix which is not 3-scalar, 
then P(A) is dense in s[ 3 (K). 

If P is assumed multilinear, the main open question is the validity of the Kaplan- 
sky-L'vov conjecture which states that the image of P is either 0, or K, or sl n (K), 
or M n (K). As a first step, we would suggest to look at a downgraded version of 
this conjecture where we weaken it by allowing the image to be a dense subset of 
either sl n (K), or M n (K). As above, one of the key problems is the existence of 
n-central multilinear polynomials. This problem is related to subtle properties of 
division algebras and is not discussed in this survey. See [KBMRj for some details. 

To sum up, we present a list of questions which seem to be a good starting point. 

Question 5. Let P: M 2 {K) d -> M 2 (K). 

(i) If P is multilinear, what are its possible images for K = R? In particular, 

does the Kaplansky-L'vov conjecture hold in this case? 
(ii) Let K be quadratically closed and P be an arbitrary (not necessarily homo- 
geneous) polynomial. Is it true that its value set is either the set {F([x, y})} 
of values of a traceless polynomial, or the whole M 2 (-J4r) (up to Zariski clo- 
sure)? 

For arbitrary n > 3 we ask weaker questions. 

Question 6. Let P: M n (K) d — > M n (K) be a multilinear polynomial. 

(i) Can it be n-central? 

(ii) Suppose that P is not n-central. Is it true that its image is then dense 
provided it contains a matrix with nonzero trace? Can it contain non- 
diagonalizable matrices? 

Similar questions can be asked for traceless polynomials. 

Remark 9. Being even more modest, one can start with multilinear Lie polyno- 
mials, asking similar questions. Even the case of algebras of small rank is open. 
Let us mention an analogue of the Kaplansky-L'vov conjecture. 
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Question 7. Let L be a Chevalley Lie algebra, and let P be a multilinear Lie 
polynomial. Is it true that the image of P on L is either 0, or LI 

Remark 10. Even a further downgrading might be of interest, in view of eventual 
generalizations of Makar-Limanov's Freiheitsatz |MLj to the case of positive char- 
acteristic: given a polynomial map P: M n (K) — > M n (K) such that n 3> degP, 
can one bound from above the codimension of its image by a "reasonable" function 
in n? 

2.3. Words with small image. If G is a finite simple group, the verbal subgroup 
(w(G)} equals either 1 or G. However, for any G one can expect the existence of 
w such that the actual image w(G) is fairly small (though different from 1). Such 
examples were recently constructed in |KN| and |Lej . 

Question 8. (N. Nikolov) |J Let G be a finite simple group, and let X C G be a 
union of conjugacy classes. Does there exist w such that w(G) = XI 

Note that a similar question for polynomials on matrix algebras over finite rings 
was answered in the affirmative in |Chuj . 

3. Width 

3.1. General words and polynomials. In the situation where map ([I]) is not 
surjective (or this is not known), but (w(G)) — G, one can ask how many elements of 
w(G) (or of w(G) ) are needed to represent every element of G. (In the case where 
(w(G)) ^ G, one represents every element of (w(G)).) The smallest such number 
is called the w- width of G. We denote it by wd w (G). If G does not have finite 
iu-width, it is said to be of infinite ly-width. The reader can find comprehensive 
discussions of this notion in [Se], [Ni] (see also [BGKj for a survey of some more 
recent developments in the case where G is a finite simple group). We focus here, 
as above, on parallels with the case of associative algebras. 

First note that as soon as we establish, for some topological group G, any kind of 
Borel's dominance theorem, in the sense that the image of w contains a dense open 
subset, we immediately conclude by a standard argument that wd w (G) < 2. Thus 
this is the case if G is (the group of points of) a connected semisimple algebraic 
group over an algebraically closed field. For arbitrary algebraic groups over fields, 
one can still prove their finite width with respect to any nontrivial word and any 
element of the verbal subgroup jMerj . If, however, we go over to algebraic groups 
over arbitrary rings, the situation changes dramatically, see the next section. 

In the class of finite simple groups, dominance arguments do not make much 
sense. However, the observation made in |La] that word maps still have, in a 
sense, a large image when viewed within an infinite family of finite simple groups, 
gave rise to a series of wonderful results on uniform word width, christened by 
Shalev 'Waring type properties". Making the long story short, we just mention 
the papers [LS], where the existence of such a uniform bound was established, and 
[LSTlj . [LST2| . where, as a culmination of efforts of many people, the uniform 
bounds wd w (G) < 2 (resp. wd w (G) < 3) have been established for all sufficiently 
large finite simple (resp. quasisimple) groups and all words ic^l. See [BGK for 
more references and details. Note that quite recently Shalev with his collaborators 



This question was answered in the affirmative by A. Lubotzky when the paper was in print. 
See: A. Lubotzky, Images of word maps infinite simple groups, arXiv:1211.6575 
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extended many results of this type to some simple algebraic groups over p-adic 
integers, see )Sh2] for a list of relevant questions. In the case of polynomial maps 
of algebras, we propose a notion parallel to word width in the group case. 

Definition 1. Let P(X\, . . . , -X^) be an associative (resp. Lie) polynomial, let A 
(resp. L) be an associative (resp. Lie) algebra over a ring R, let P: A d — > A (resp. 
P: L d — >■ L) denote the induced map, and let V P be the P-module spanned by the 
image of P. If there exists a positive integer m such that every element v G Vp can 
be represented as a sum of m values of P, we call the least m with such property 
the P- width of the algebra and denote it wdp(A) (resp. wdp(L)). Otherwise, we 
say that the algebra is of infinite P-width. 

We are not aware of any general results concerning this notion, beyond those 
that follow from the surjectivity or dominance (see, however, next sections for 
some particular cases). The following question seems conceptually important. 

Question 9. Let k be an infinite field. Let A denote the class of finite-dimensional 
central simple fc-algebras. Let V denote the class of associative polynomials such 
that none of them is central for some A E A. Is it true that all A 6 A are of finite 
P-width for all P € VI If yes, is it true that sup PeV AeA wdp(A) < oo? 

This question makes sense in more restrictive set-ups, when either some class 
of polynomials or some class of algebras, or both, are fixed (and may be narrow 
enough, even consisting of one element, see some examples below). Similar questions 
may be posed for Lie polynomials on finite-dimensional simple Lie algebras. Finally, 
in both cases (associative and Lie) one can also consider infinite-dimensional simple 
algebras, finitely or infinitely generated. All this seems completely unexplored. 
Below we consider a couple of more concrete settings. 

3.2. Commutator width. The case of the width of various groups G with respect 
to w — xyx^ 1 y^ 1 got much attention in the literature. Without pretending to 
giving a comprehensive survey (various aspects are reflected in (Sej, |NI], |HS VZ2] . 
BGK]), we only present some references. 

• As mentioned above, the finite simple groups have commutator width 1. 
As to finite quasi-simple groups, it is at most 2, and all the cases when it 
actually equals 2 are listed [LQST2] . 

• As mentioned above, if G is a Chevalley group over an infinite field, its 
commutator width is at most 2, in view of Borel's theorem. However, this 
breaks down for Chevalley groups over rings, which tend to have very few 
commutators (the type of behaviour called "anti-Ore" in [HSVZlj ). In view 
of examples of groups of infinite commutator width [DVj , we can also call 
it "anti- Waring" . The situation improves if we go over to infinite matrices: 
most Chevalley groups of "infinite rank" return to Waring (if not to Ore) 
behaviour: it is known that their commutator width is at most 2 HS , DVJ; 
similar results were recently obtained for other groups of infinite matrices, 
see |GH] and references therein. This gives rise to a natural question: 

Question 10. Is it true that the groups of commutator width at most 2 listed in 
[DVj are of finite word width with respect to any nontrivial word wl Is the image 
of w "large" (i.e., do we have any analogue of Borel's theorem)? 
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Here are some parallel results and questions for the additive commutator 

P(X,Y) =XY -YX 

in associative algebras. 

• The commutator width of a matrix algebra A — M n (R) over any ring is 
at most two ( |ARj for the case where R is a division algebra, }Ros| for an 
arbitrary commutative ring i?, |Mesj in general). 

• Moreover, in the representation M = [X, Y] + [Z, T] one can fix X and Z 
which are good for every M E A (in [AR only Z was fixed, and this was 
strengthened in |RRlj . [Rosj and |Mesj using genericity arguments). 

This gives rise to the following notion, slightly resembling the notion of one-and- 
a-half-generation of simple groups [Steij , [GK . 

Definition 2. Let A be an associative algebra of commutator width m. If there 
exist Xi, . . . , X s € A such that every M G A can be represented in a form 

M=[X 1 ,Y 1 } + --- + [X.,Y„] + [Z S+1 ,Y S+1 ] +... + [Z m , Y m ], 

and s is maximal with this property, we say that A is of fractional commutator 
width m — 7t- ■ 

Thus the matrix algebras are of fractional commutator width one and a half (the 
result of |ARj gave only one and three-quarters). 

One can define fractional commutator width for groups in a similar fashion and 
ask the following question. 

Question 11. Which of the groups G having commutator width 2 are of fractional 
commutator width one and a half? one and three-quarters? 

One can start with Chevalley groups of infinite rank DV and 14 finite quasisim- 
ple groups of width 2 from the list of JLOST2J . 

One can define fractional width for more general words and polynomials. We 
leave this to the interested reader. 

3.3. Power width. Again, we only briefly quote some results on power width in 
groups. For finite simple groups, the story which started with establishing the 
existence of uniform finite width in MZl , [SWj led to almost conclusive results in 
[GM] where for many powers it was proved that this width equals 2. The uniform 
width for general finite groups was established in |NSj . For infinite groups (e.g., 
for Chevalley groups over rings), the problem is almost unexplored. As in the 
commutator case, some pathologies have been discovered (see the next section). 

For matrix algebras M n (R), we only mention the pioneering paper [Vaj where 
the problem of representing a matrix over a commutative ring as a sum of d th 
powers was considered, and some general results of finite power width flavour were 
obtained (see |KG] for the history of the problem and more references), and |Pu2j 
for some results for matrices over noncommutative rings, in particular, for central 
simple algebras. 

3.4. Monsters. It was unknown for a long time whether there exists a simple group 
of commutator width greater than 1. The first counter-example |BGj appeared in 
the context of symplectic geometry and gave a simple (infinitely generated) group 
of infinite commutator width. Later on, using various contexts, there were con- 
structed simple groups of infinite commutator width which are finitely generated 
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( [Mulj . using small cancellation theory) or finitely presented ( |CFj . among quo- 
tients of Kac-Moody groups) ; among groups appearing in |Mulj , there are those of 
arbitrary large finite commutator width. In |Mu2) . there were constructed simple 
groups of infinite square width (and hence infinite commutator width, because of 
the equality [x,y] — x 2 (x~ l y) 2 (y~ 1 ) 2 , showing that finite commutator width im- 
plies finite square width). We refer the interested reader to |GG] for a walk along 
the zoo of monsters with such anti-Ore and anti- Waring behaviour (and some con- 
ceptual explanations of such phenomena) , which grew up in differential-geometric 
environment, following the spirit of BG]; they are fed up using some advanced 
techniques, including quasi-morphisms and quantum cohomology 

It is an interesting question whether such monsters exist among Lie algebras. The 
case of finite-dimensional algebras of Cartan type over fields of positive characteris- 
tic was already mentioned above. As to infinite-dimensional algebras in characteris- 
tic zero, discussions with E. Zelmanov give a hint that there are infinitely generated 
simple Lie algebras of infinite width and finitely generated ones of arbitrary large 
finite width. Elaborating such examples could give a clue to understanding the 
situation with associative algebras. 

As a final remark, we should mention that apart from Lie algebras, little is known 
on the problems discussed in this paper in the case of non-associative algebras; 
see, however, [Gordo] and [Pulj for some particular results of this flavour. In our 
opinion, it would be interesting to study the image of more general polynomial maps 
for some classical examples, such as Cayley octonions, simple quadratic Jordan 
algebras, simple exceptional Jordan algebras HC3, and the like. 
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